Complexity of Constrained VC-Classes

نویسنده

  • Joel Ratsaby
چکیده

Let F be a class of n-dimensional binary vectors, i.e., functions f : X → {0, 1} where X = [n] ≡ {1, . . . , n} with a VC-dimension V C(F) = d. The classical result of Sauer says that the complexity of F is bounded as |F| ≤ d i=0 n i ≡ S(d, n). How does the complexity decrease as one further constrains the subset of allowed functions in F ? The paper defines a constraining parameter for binary functions, called the margin μh(x, y) of h at x ∈ [n], which is a form of confidence that h takes the value y at x. Let a sample ζ = {(xi, yi)} l i=1, xi ∈ [n], yi ∈ {0, 1}, and for N ≥ 0, consider a class HN (ζ) ⊆ F of functions h having μh(xi, yi) > N , 1 ≤ i ≤ l. The above question is answered by estimating the cardinality |HN (ζ)| as a function of the margin parameter N by E1 = 1+exp(−(l+2(N +1))/n)S(d, n). In the extreme case, where ζ is the maximal-size sample on which every h ∈ HN(ζ) has μζ(h) > N , the estimate is E2 = exp(− exp(−(2N + 1)))(1 + exp(−(l + 2(N + 1))/n))S(d, n)). The latter is exponentially smaller than E1 in the region of N < N ′, where N ′ is approximately (1/2) ln(n).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sauer's Bound for a Notion of Teaching Complexity

This paper establishes an upper bound on the size of a concept class with given recursive teaching dimension (RTD, a teaching complexity parameter.) The upper bound coincides with Sauer’s well-known bound on classes with a fixed VC-dimension. Our result thus supports the recently emerging conjecture that the combinatorics of VC-dimension and those of teaching complexity are intrinsically interl...

متن کامل

Complexity of VC-classes of sequences with long repetitive runs

The Vapnik-Chervonenkis (VC) dimension (also known as the trace number) and the Sauer-Shelah lemma have found applications in numerous areas including set theory, combinatorial geometry, graph theory and statistical learning theory. Estimation of the complexity of discrete structures associated with the search space of algorithms often amounts to estimating the cardinality of a simpler class wh...

متن کامل

On the complexity of constrained VC-classes

Sauer’s Lemma is extended to classes HN of binary-valued functions h on [n] = {1, . . . , n} which have a margin less than or equal to N on all x ∈ [n] with h(x) = 1, where the margin μh(x) of h at x ∈ [n] is defined as the largest non-negative integer a such that h is constant on the interval Ia(x) = [x− a, x+ a] ⊆ [n]. Estimates are obtained for the cardinality of classes of binary valued fun...

متن کامل

Recursive teaching dimension, VC-dimension and sample compression

This paper is concerned with various combinatorial parameters of classes that can be learned from a small set of examples. We show that the recursive teaching dimension, recently introduced by Zilles et al. (2008), is strongly connected to known complexity notions in machine learning, e.g., the self-directed learning complexity and the VC-dimension. To the best of our knowledge these are the fi...

متن کامل

Recursive Teaching Dimension, Learning Complexity, and Maximum Classes

This paper is concerned with the combinatorial structure of concept classes that can be learned from a small number of examples. We show that the recently introduced notion of recursive teaching dimension (RTD, reflecting the complexity of teaching a concept class) is a relevant parameter in this context. Comparing the RTD to self-directed learning, we establish new lower bounds on the query co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005